Real-Time Classification of Twitter Trends
نویسندگان
چکیده
The community of users participating in social media tends to share about common interests at the same time, giving rise to what are known as social trends. A social trend reflects the voice of a large number of users which, for some reason, becomes popular in a specific moment. Through social trends, users therefore suggest that some occurrence of wide interest is taking place and subsequently triggering the trend. In this work, we explore the types of triggers that spark trends on the microblogging site Twitter, and introduce a typology that includes the following four types: news, ongoing events, memes, and commemoratives. While previous research has analyzed the characteristics of trending topics in a long term, we look instead at the earliest tweets that produce the trend, with the aim of categorizing trends early on and providing a filtered subset of trends to end users. We propose, analyze and experiment with a set of straightforward languageindependent features that rely on the social spread of the trends to discriminate among those types of trending topics. Our method provides an efficient way to immediately and accurately categorize trending topics without need of external data, enabling news organizations to track and discover breaking news in real-time, or quickly identify viral memes that might enrich marketing decisions, among others. The analysis of social features as observed in social trends also reveals social patterns associated with each type of trend, such as tweets related to ongoing events being shorter as many of the tweets were likely sent from mobile devices, or memes having more retweets originating from fewer users than for other kinds of
منابع مشابه
Design and Test of the Real-time Text mining dashboard for Twitter
One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...
متن کاملA High-Performance Model based on Ensembles for Twitter Sentiment Classification
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...
متن کاملHip and trendy: Characterizing emerging trends on Twitter
Twitter, Facebook, and other related systems that we call social awareness streams are rapidly changing the information and communication dynamics of our society. These systems, where hundreds of millions of users share short messages in real time, expose the aggregate interests and attention of global and local communities. In particular, emerging temporal trends in these systems, especially t...
متن کاملA Study on Microblog and Search Engine User Behaviors: How Twitter Trending Topics Help Predict Google Hot Queries
Once every five minutes, Twitter publishes a list of trending topics by monitoring and analyzing tweets from its users. Similarly, Google makes available hourly a list of hot queries that have been issued to the search engine. We claim that social trends fired by Twitter may help explain and predict web trends derived from Google. Indeed, we argue that information flooding nearly real-time acro...
متن کاملPredicting election trends with Twitter: Hillary Clinton versus Donald Trump
Forecasting opinion trends from real-time social media is the long-standing goal of modern-day big-data analytics. Despite its importance, there has been no conclusive scientific evidence so far that social media activity can capture the opinion of the general population at large. Here we develop analytic tools combining statistical physics of complex networks, percolation theory, natural langu...
متن کاملA Latent Source Model for Nonparametric Time Series Classification
For classifying time series, a nearest-neighbor approach is widely used in practice with performance often competitive with or better than more elaborate methods such as neural networks, decision trees, and support vector machines. We develop theoretical justification for the effectiveness of nearest-neighbor-like classification of time series. Our guiding hypothesis is that in many application...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JASIST
دوره 66 شماره
صفحات -
تاریخ انتشار 2015